Phrasier: An Interactive System for Linking and Browsing Within Document Collections Using Keyphrases

نویسنده

  • Steve Jones
چکیده

When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. Manual, or semi-automated link creation is often infeasibly time-consuming for large document collections. We present Phrasier, an interactive system which automatically introduces links to related material into documents as the user browses and queries a digital library collection. Suitable links are identified using keyphrases that are identified within document text and support both topicbased and inter-document navigation. Previews of link destinations are provided to reduce unproductive link traversals, and important segments of document text are identified and highlighted to support skimming of viewed documents. Evaluation has shown that PhrasierÕs keyphrase-based linking mechanism produces sparse hypertexts, although similar documents tend to have short paths between them. A study using human assessors in a simulated document retrieval task indicated that the generated links are perceived to be useful and relevant.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Evaluation of Phrasier, an Interactive System for Linking Documents Using Keyphrases

When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. Manual, or semi-automated link creation is often infeasibly time-consuming for large document collections. We present Phrasier, an interactive system which automatically introduces links to related material into documents as the user browses and querie...

متن کامل

Link as You Type: Using Key Phrases for Automated Dynamic Link Generation

When documents are collected together from diverse sources they are unlikely to contain useful hypertext links to support browsing amongst them. For large collections of thousands of documents it is prohibitively resource intensive to manually insert links into each document. Users of such collections may wish to relate documents within them to text that they are themselves generating. This pro...

متن کامل

Finding nuggets in documents: A machine learning approach

However, many text mining applications do not have adequate natural language processing ability beyond simple keyword indexing, and as a result, there are too many textual elements (words) included in the analysis. We argue that noun phrases as textual elements are better suited for text mining and could provide more discriminating power, than single words. Discourse representation theory (Kamp...

متن کامل

Interactive Demo: Stay in Touch with InfoVis – Visualizing Document Collections with Document Cards

Large document collections are essential resources for a wide variety of professionals, like scientists, lawyers, analysts, etc. An electronic document management system can assist them in solving the tedious tasks of curating, browsing, searching, and recognizing documents in these collections. As an initial step in creating such a system, we invented the Document Cards [3] as a mixed image-te...

متن کامل

Cross-language Entity Linking Adapting to User’s Language Ability

In this paper, we propose a method to automatically discover valuable keyphrases in Japanese and link these keyphrases to related Chinese Wikipedia pages. The method that we propose has four stages. Firstly, we extract nouns from a Japanese document using a morphological analyzer and extract the candidates of keyphrases using a method called Top Consecutive Nouns Cohesion (TCNC) [1]. Then, we j...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999